A Relative Value Iteration Algorithm for Nondegenerate Controlled Diffusions
نویسندگان
چکیده
Abstract. The ergodic control problem for a non-degenerate controlled diffusion controlled through its drift is considered under a uniform stability condition that ensures the well-posedness of the associated Hamilton–Jacobi– Bellman (HJB) equation. A nonlinear parabolic evolution equation is then proposed as a continuous time continuous state space analog of White’s ‘relative value iteration’ algorithm for solving the ergodic dynamic programming equation for the finite state finite action case. Its convergence to the solution of the HJB equation is established using the theory of monotone dynamical systems and also, alternatively, by using the theory of reverse martingales.
منابع مشابه
A Correction to “A Relative Value Iteration Algorithm for Nondegenerate Controlled Diffusions'' | SIAM Journal on Control and Optimization | Vol. 55, No. 3 | Society for Industrial and Applied Mathematics
In A Relative Value Iteration Algorithm for Nondegenerate Controlled Diffusions, [SIAM J. Control Optim., 50 (2012), pp. 1886–1902], convergence of the relative value iteration for the ergodic control problem for a nondegenerate diffusion controlled through its drift was established, under the assumption of geometric ergodicity, using two methods: (a) the theory of monotone dynamical systems an...
متن کاملA Correction to "A Relative Value Iteration Algorithm for Nondegenerate Controlled Diffusions"
In A Relative Value Iteration Algorithm for Nondegenerate Controlled Diffusions, [SIAM J. Control Optim., 50 (2012), pp. 1886–1902], convergence of the relative value iteration for the ergodic control problem for a nondegenerate diffusion controlled through its drift was established, under the assumption of geometric ergodicity, using two methods: (a) the theory of monotone dynamical systems an...
متن کاملConvergence of the Relative Value Iteration for the Ergodic Control Problem of Nondegenerate Diffusions
We study the relative value iteration for the ergodic control problem under a nearmonotone running cost structure for a nondegenerate diffusion controlled through its drift. This algorithm takes the form of a quasi-linear parabolic Cauchy initial value problem in Rd. We show that this Cauchy problem stabilizes or, in other words, that the solution of the quasi-linear parabolic equation converge...
متن کاملConvergence of The Relative Value Iteration for the Ergodic Control Problem of Nondegenerate Diffusions under Near-Monotone Costs
We study the relative value iteration for the ergodic control problem under a nearmonotone running cost structure for a nondegenerate diffusion controlled through its drift. This algorithm takes the form of a quasilinear parabolic Cauchy initial value problem in R. We show that this Cauchy problem stabilizes, or in other words, that the solution of the quasilinear parabolic equation converges f...
متن کاملOn the Non-Uniqueness of Solutions to the Average Cost HJB for Controlled Diffusions with Near-Monotone Costs
We present a theorem for verification of optimality of controlled diffusions under the average cost criterion with near-monotone running cost, without invoking any blanket stability assumptions. The implications of this result to the policy iteration algorithm are also discussed. Index Terms controlled diffusions, near-monotone costs, Hamilton–Jacobi–Bellman equation, policy iteration
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- SIAM J. Control and Optimization
دوره 50 شماره
صفحات -
تاریخ انتشار 2012